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Abstract 

Despite of the variety of available Web services registries specially aimed at Life Sciences, their scope is usually restricted to 
a limited set of well-defined types of services. While dedicated registries are generally tied to a particular format, general- 
purpose ones are more adherent to standards and usually rely on Web Service Definition Language (WSDL). Although WSDL 
is quite flexible to support common Web services types, its lack of semantic expressiveness led to various initiatives to 
describe Web services via ontology languages. Nevertheless, WSDL 2.0 descriptions gained a standard representation based 
on Web Ontology Language (OWL). BioSWR is a novel Web services registry that provides standard Resource Description 
Framework (RDF) based Web services descriptions along with the traditional WSDL based ones. The registry provides Web- 
based interface for Web services registration, querying and annotation, and is also accessible programmatically via 
Representational State Transfer (REST) API or using a SPARQL Protocol and RDF Query Language. BioSWR server is located at 
><http://inb.bsc.es/BioSWR/and its code is available at ><https://sourceforge.net/projects/bioswr/under the LGPL license. 
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Introduction 

To the extent that the number of Web services available to the 
Life Science community is continuously growing, there is a need to 
provide better ways for their description, categorization and 
discovery. Existing Web services catalogues like EMBRACE [1] or 
BioCatalogue [2] are usually bound to Web Service Definition 
Language (WSDL) and limit themselves to annotation of service 
entities. The need to provide a richer way to describe Web services 
raised an interest for ontology-based models for Web services 
description. A large variety of Web services types and protocols 
either limits the scope of such ontologies to concrete type of 
services or makes them so abstract that still requires WSDL usage. 
In the latter case, a link between WSDL components and their 
semantic descriptions is usually done via Semantic Annotations for 
WSDL and XML Schema (SAWSDL) annotations [3]. 

WSDL 2.0 brought a new conceptual model with considerable 
improvements in Representational State Transfer (REST) Web 
services description. A possibility to describe RESTful Web 
services along with Simple Object Access Protocol (SOAP) based 
ones is a clear step forward especially in life science domain where 
both approaches are intensively used. Another remarkable 
improvement of WSDL 2.0 was the introduction of Internation- 
alized Resource Identifiers (IRIs) for its described components. 
The ability to unambiguously identify every WSDL 2.0 entity as a 
resource allows using these identifiers within ontologies. WSDL 
2.0: RDF Mapping specification provides such ontology to express 



WSDL 2.0 in Web Ontology Language (OWL). The possibility to 
describe Web services using standard semantic vocabularies, does 
not replace the need of ontologies for the life science domain, but 
rather provides a better integration where Web services may be 
represented in pure semantic way. 

BioSWR registry is a response to the need to introduce a 
standard semantic view into Web services in addition to the 
traditional WSDL-based one. OWL/RDF representation of 
service definitions allows using SPARQL Protocol and RDF 
Query Language (SPARQL) for Web services discovery and 
annotation, while WSDL-based representation provides a com- 
patibility with existing Web services development tools. 

The choice of WSDL 2.0 as a basement for Web services 
descriptions was dictated by the need to support a broader range 
of Web services. The required easy bidirectional transformation 
between representational models puts further limitations to the 
choice of the Web services description ontology. Within several 
frameworks and specifications aimed at semantic Web services 
description, W3G Web Services Description Language (WSDL) 
Version 2.0: RDF Mapping specification has been chosen as a 
standard basement for a modern Semantic Web Registry 
implementation. 

Functionality 

BioSWR is designed to support WSDL 1.1/2.0 Web services, 
providing a local storage for the registered services and dependent 
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XML Schema files (Figure 1). While WSDL is generally used to 
describe SOAP and sometimes RESTful Web services, WSDL 2.0 
component model is protocol agnostic and may be adapted to 
virtually any type of services. To extend a number of supported 
Web services, BioSWR provides also support for BioMoby [4] 
services. In the latter case the support is implemented via 
SAWSDL extension, embedding my Grid BioMoby semantic model 
[5] into WSDL 2.0 descriptor. BioSWR also relies on SAWSDL 
for general services annotation, using EMBRACE Data and 
Methods (EDAM) [6] ontology as the primary source of semantic 
annotations. 

RESTful services are becoming very popular due to their 
simplicity of use. For this same reason, unlike SOAP-based web- 
services, they usually lack formal description and are difficult to 
integrate into workflows. To help developers in the description, 
and registration of their RESTful web-services, BioSWR docu- 
mentation provides a simple WSDL 1.1 template. For those 
RESTful web-services that may be accessed via web browser, 
BioSWR also provides a sample URL-based template. Finally, for 
clients requiring more formal descriptions, a WADL description is 
provided along with the stored WSDL. 

BioSWR REST API 

BioSWR provides a REST-based API to manage Web services 
storage (Table 1). The API offers HTTP access to stored Web 
services definitions that can be directly imported into tools like 
Taverna [7]. While new Web service registration may be 
performed by any authenticated user, service removal may be 
accomplished only by a service owner - the user who originally 
registered the service. Web service owner may also allow other 
users to annotate the service, keeping the rights to remove 
inappropriate annotations. Credentials should be provided via 
standard basic HTTP Authentication [8] . 

Semantic data querying 

Instead of providing a custom API for Web services search, 
BioSWR provides SPARQL querying over stored Web services 



descriptions. BioSWR supports SPARQL 1.1 Protocol query 
variations via HTTP GET or HTTP POST bindings (Figure 2). 

To facilitate SPARQL-based repository discovery, all results are 
provided with a wsdli : wsdlLocation property to locate the 
original description document as it was found in the registry. 
SPARQL query may also be used to filter Web services in the Web 
interface, however to provide a friendly interface for non-expert 
users, simple text-based search is available. 

SPARQL UPDATE support provides a simple programmatic 
way to manage semantic annotations such as rdf s : comment and 
SAWSDL references (Figure 3). 

Semantic enrichment 

In accordance with SAWSDL specification, semantic enrich- 
ment is attained via sawsdl rmodelRef erence attributes. The 
choice of an appropriate annotation subject is defined internally as 
logical axioms and realized through semantic reasoning (Figure 4). 
This approach provides flexibility when choosing external 
annotation sources. BioSWR provides EDAM ontology integra- 
tion. EDAM was specially designed for bioinformatics/ computa- 
tional biology domain and provides a wide coverage of common 
bioinformatics objects and methods. 

Apart from SAWSDL references, basic OWL 2 annotation 
properties such as rdf s : comment, rdf s : seeAlso and 
rdf s : isDef inedBy are supported. BioSWR keeps track of all 
annotations, annotating them with rdf s : isDef inedBy (annota- 
tion of another annotation). The latter provides flexibility in 
annotation management, where only authorized authors may 
modify outdated annotations. 

BioMoby integration 

BioMoby has been a widely used framework to deploy general 
and specialized bioinformatics WS. Although BioMoby usage has 
declined over the last years, a large number of services still exist. 
BioSWR provides BioMoby integration through semantically 
enriched WSDL 2.0 descriptions. While BioMoby services are 
SOAP-based they use a special BioMoby message format which 
cannot be expressed in XML Schema. From SOAP point of view 
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Figure 1. BioSWR general architecture. 
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Table 1. BioSWR REST Web services API. 





HTTP location 


HTTP method 


Description 


/service/register?url 
= {url}&lsid = {lsid} 


GET 


Registers the service providing its WSDL file URL or a BioMoby LSID 
identifier. Returns a WSDL service description. 


/service 


GET 


Get an OWL/RDF ontology with all registered services. 


/service/{id} 


GET 


Get a Web service description by its identifier. 


/service/{id} 


DELETE 


Removes a Web service with given identifier. 



doi:10.1371/journal.pone.0107889.t001 



its content constitutes an encoded string. Fortunately, BioMoby 
already provides its own service's descriptions through BioMoby 
my Grid ontology. These definitions are integrated into generated 
BioMoby WSDL 2.0 descriptors and linked with input/ output 
elements via SAWSDL annotations. The tinyMOBY [9] extension 
of tiny WSDL [10] parser provides my Grid semantic annotations 
management within WSDL 2.0 descriptions. tinyMOBY allows to 
extract BioMoby data-type definitions from the embedded my Grid 
ontology providing a direct integration with BioMoby Java API 
[11]. The latter eliminates the need of using BioMoby Registries, 
mostly inactive nowadays, since tinyMOBY datatype definitions 
includes all required information for BioMoby message prepara- 
tion and further Web service execution. 

WADL support 

For WSDL 2.0 services that are described through HTTP 
Binding Extension, Web Application Description Language 
(WADL) descriptors may be also obtained via the BioSWR REST 
API providing an HTTP "Accept: application/vnd.sun.wadl+ 



xml" header. The WADL descriptor may be also found in the 
service description panel of the Web interface. 

Web services monitoring 

One of the most challenging issues in providing any kind of tool 
registry in Bioinformatics is to keep track of their availability. 
There are several levels of Web services monitoring that are 
usually performed to verify Web services operability. BioSWR 
implements an availability check inspecting the original Web 
service description that has been used for the registration. The 
check is performed periodically or upon user request. The absence 
of Web service description is interpreted as service withdrawal. 
Modifications of the original Web service descriptions are detected 
using cyclic redundancy check (CRC) algorithm [12]. An 
indication of the status of the service (active, modified, or 
unavailable) is included in the web interface, and services list 
can be filtered by such parameter. 



PREFIX rdf: <http://www.w3.Org/1999/02/22-rdf-syntax-ns#> 

DESCRIBE ?s WHERE { ?s rdf:type <http://www.w3.Org/ns/wsdl-rdf#Service> . } 



GET/sparql?query=PREFIX+rdf%3A+%3Cht^ 

%23%3E+DESCRIBE+%3Fs+WHERE+%7B+%3Fs+rdf%3Atype+%3Chttp%3A%2F%2Fwww.w3.org%2Fns 
%2Fwsdkdf%23Service%3E++%7D+ 

HTTP/1.1 200 OK 
Content-Type: application/rdf+xml 

<?xml version-' 1 .0" encoding="UTF-8"?> 
<rdf:RDF x m I ns : rdf http://www.w3. org/1 999/02/22-rdf-syntax-ns#" 
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" 
xmlns:owl="http://www.w3.org/2002/07/owl#" 
xmlns:wsdli="http://www.w3.org/ns/wsdl-instance#" 
xmlns="http://www.w3.org/ns/wsdl-rdf#"> 
<rdf:Description rdf:about="urn:lsid:inb.bsc.es#wsdl.service(runNCBIBIastp)"> 
<rdf :type rdf :resou rce- 'http ://www.w3 .org/2002/07/owl#Named I nd ivid ua I7> 
<rdf:type rdf:resource="http://www.w3.org/ns/wsdl-rdf#Service7> 
<rdfs:label rdf:datatype="http://www.w3.org/2001/XMLSchema#8tring"> 
<wsdli:wsdll_ocation rdf:datatype="http://www.w3.org/2001/XMLSchema#anyURr'> 

.../BioSWR/rest/service/162A77DF4D559C4ECC626D350172CE85 
</wsdli:wsdlLocation> 

<endpoint rdf:resource="urn:lsid:inb.bsc.es#wsdl.endpoint(runNCBIBIastp/runNCBIBIastp)7> 
implements rdf:resource="urn:lsid:inb.bsc.es#wsdl.interface(runNCBIBIastp)7> 
</rdf:Description> 

</rdf:RDF> 

Figure 2. Find all registered Web services via SPARQL DESCRIBE query. 

doi:1 0.1 371 /journal. pone.01 07889.g002 
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PREFIX rdf: <http://www.w3.Org/1999/02/22-rdf-syntax-ns#> 
PREFIX sawsdl: <http://www.w3. org/ns/sawsdl#> 
INSERT DATA { <urn:lsid:inb.bsc.es#wsdl.interface(getEntryFromPDB)> 
sawsdl: model Reference 

'http://example.com' } 

POST /BioSWR/rest/sparql/ HTTP/1.1 
Host: inb.bsc.es 

Content-Type: application/sparql-update; charse!=UTF-8 

PREFIX rdf: <http://www.w3.Org/1999/02/22-rdf-syntax-ns#> 
PREFIX sawsdl: <http://www.w3. org/ns/'sawsdl#> 
INSERT DATA{ <um:lsid:inb.bsc.es#wsdl.interface(getEntryFromPDB)> 
sawsdl:modelReference 'http://example.com' } 

HTTP/1.1 200 OK 
Content-Length: 0 



PREFIX rdf: <http://www.w3.Org/1999/02/22-rdf-syntax-ns#> 

PREFIX sawsdl: <http://www.w3.org/ns/sawsdl#> 

DELETE DATA { <urn:lsid:inb.bsc.es#wsdl.interface(getEntryFromPDB)> 

sawsdl:modelReference 

'http://example.com' AA <http://www.w3.org/2001/XMLSchema#string> } 

POST /BioSWR/rest/sparql/ HTTP/1.1 
Host: inb.bsc.es 

Content-Type: application/sparql-update; charset=UTF-8 

PREFIX rdf: <http://www,w3.org/1999/02/22-rdf-syntax-ns#> 

PREFIX sawsdl: <http://www.w3.0rg/rts/sawsdl#> 

DELETE DATA{ <urn:lsid:inb.bsc.es#wsdl.interface(getEntryFromPDB)> 

sawsdfmodelReference 'http:,' , /ex3mple.com' A ' l <http://www.w3.org/2001/XMLSchema#string> } 

HTTP/1.1 200 OK 
Content-Length: 0 



Figure 3. Add/Remove SAWSDL reference via SPARQL UPDATE query. 

doi:1 0.1 371 /journal. pone.01 07889.g003 



Implementation 

BioSWR is implemented using Java EE 6 Platform. Web 
interface is based on Java Server Faces 2.0 and RichFaces 
component framework. BioSWR REST API is implemented using 
Java API for RESTful Web Services. SPARQL protocol 
implementation is based on openRDF Sesame framework [13]. 
Registered services are stored in a MySQL database. 

Technologies chosen as a ground for the registry became a 
challenging task of implementation of the latest standards in 
Semantic Web services. To achieve BioSWR goals several brand- 
new Java libraries have been developed and contributed to the 
community: 

wsdl2rdf service description library 

Semantic representation of Web services requires a solid tool to 
provide a mapping between WSDL 2.0 and OWL model 
representation. The wsdl2rdf library provides an easy and 
straightforward API for WSDL 2.0 ontology management, hiding 
OWL complexity from developers. As WSDL 2.0 ontology does 
not impose most of the restrictions defined in WSDL 2.0 
specification, the advantage of the API usage over a straight 
ontology manipulation is to provide ontology consistency valida- 
tion. The wsdl2rdf library strictly follows the original ontology 



provided by the WSDL 2.0 RDF Mapping specification [14] and 
is based on The OWL API [15]. 

WSDL 2.0 parsing library 

To parse and manipulate WSDL 2.0 descriptions, a brand new 
WSDL 2.0 library has been developed. The library is based on 
WSDL 2.0 Part 1: Core Language [16] and WSDL 2.0 Part 2: 
Adjuncts specifications [17], and supports both SOAP and HTTP 
binding extensions. A complementary tinyXmlSchema exten- 
sion library has been developed to provide an easy manipulation of 
referenced XML Schema elements. The tinyXmlSchema library is 
based on Apache XML Schema 2.0 library [18]. In addition to 
standard WSDL 2.0 extensions, tinyWSDL supports SAWSDL 
annotations. 

WSDL 2.0 BioMoby extension library 

In order to provide better integration with BioMoby services, 
tinyMOBY extension library has been developed. The library 
allows representing BioMoby services through semantically 
enriched WSDL 2.0 descriptors. Generated descriptors embed 
my Grid BioMoby RDF definitions that can be used to reconstruct 
BioMoby message format. As well as the owl2rdf library, 
tinyMOBY is based on The OWL API. The integration with 
BioMoby is implemented via lightweight BioMoby Java API. 



Prefix: : <http://edamontology.org/> 
Ontology: <http://edamontology.org> 

Class: topic_0078 SubClassOf: topic_0003 
Class: topic_2225 SubClassOf: topic_0078 

13 _ 

Prefix: : <http://edamontology.org/> 

Prefix: wsdl: <http://www.w3.org/ns/wsdl-rdf#> 

Prefix: sawsdl: <http://www.w3.org/ns/sawsdl#> 

Class: wsdl interface SubClassOf: sawsdl#modelReference only topic_0003 



Prefix: wsdl: <http://www.w3.org/ns/wsdl-rdf#> 
Ontology: <http://www.w3.org/ns/wsdl-rdf> 



Class: wsdl: Interface 



Prefix: sawsdl: <http://www.w3.org/ns/sawsdl#> 
Prefix: wsdl-rdf: <http://www.w3.org/ns/wsdl-rdf#> 
Ontology: <urn:lsid:inb.bsc.es> 

Individual: <um:lsid:inb.bsc.es#wsdl.interface(getEntryFromPDB)> Types: wsdl-rdf: Interface 
sawsdhmodelReference "http://edamontology.org/topic_2225" AA xsd:string 

Figure 4. Example of Semantic Rules definitions. Here wsdl: Interface sawsdl#modelReference property is restricted to topic_0003 (Topic). 
Because topic_2225 (Protein databases) is a subclass of topic_0003, urn:lsid:inb.bsc.es#wsdl.interface(getEntryfromPDB) interface (which is an 
individual of wsdl: Interface) may be annotated with it without making the ontology inconsistent. 
doi:1 0.1 371 /journal.pone.01 07889.g004 
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Discussion 

Extending the EMBRACE and BioCatalogue registries philos- 
ophies, BioSWR offers many unique features for Semantic Web 
Services providers and consumers. The most significant advance- 
ment of BioSWR is the adoption of WSDL 2.0 and its standard 
OWL-based representation. Semantic representation of Web 
services allows using SPARQL query language for Web services 
discovery and annotations and greatly simplifies BioSWR REST 
API eliminating a need in respective methods. 

BioMoby was a fairly extended protocol for SOAP-based web 
services, and a significant number of services are still available. 
BioSWR offers semantically enriched WSDL 2.0 descriptors for 
BioMoby services, freeing BioMoby clients from the need to 
interact with a BioMoby Central, and simplifying BioMoby 
services execution. 

BioSWR WSDL 2.0 support also contributes to bringing 
RESTful Web services back to the standards. Being very popular 
to access data from repositories due to its simplicity of use, 
RESTful Web services are often developed outside of the 
established standards. This precludes those services from being 
integrated in bioinformatics, and even makes difficult its usage 
with current clients without a manual adaptation. BioSWR has 
been developed with a special interest in such integration. For 
RESTful Web services BioSWR provides both WSDL and WADL 
descriptions, which can be used by appropriate clients. BioSWR 
also facilitates URL-based templates for RESTful retrieve 
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